Interactive Exploration of Large Dendrograms with Prototypes

نویسندگان

چکیده

Hierarchical clustering is one of the standard methods taught for identifying and exploring underlying structures that may be present within a dataset. Students are shown examples in which dendrogram, visual representation hierarchical clustering, reveals clear structure. However, practice, data analysts today frequently encounter datasets whose large scale undermines usefulness dendrogram as visualization tool. Densely packed branches obscure structure, overlapping labels impossible to read. In this article we new workflow performing via R package called protoshiny aims restore its former role being an effective versatile Our proposal leverages interactivity combined with ability label internal nodes representative point (called prototype). After presenting workflow, provide three case studies demonstrate utility.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Understanding Hierarchical Clustering Results by Interactive Exploration of Dendrograms: A Case Study with Genomic Microarray Data

Hierarchical clustering is widely used to find patterns in multi-dimensional datasets, especially for genomic microarray data. Finding groups of genes with similar expression patterns can lead to better understanding of the functions of genes. Early software tools produced only printed results, while newer ones enabled some online exploration. We describe four general techniques that could be u...

متن کامل

Interactive exploration of large filesystems

Secure management of file systems of large organizations can present significant challenges to system administrators, in terms of the number of users, shared access to parts of the file system for supporting large software projects, and securing and monitoring critical parts of the file system from intruders. We present interactive visualization tools for monitoring and viewing the complex acce...

متن کامل

Interactive exploration of large image repositories

Nowadays, the majority of people possess some form of digital camera to use in their everyday lives. Devices range from relatively low quality web cameras, to medium range cameras integrated into mobile devices, to higher quality cameras aimed at the average user, on to high end cameras used by professional photographers. Affordability of devices and storage media coupled with increased capabil...

متن کامل

GraphVista: Interactive Exploration Of Large Graphs

The potential to gain business insights from graph-structured data through graph analytics is increasingly attracting companies from a variety of industries, ranging from web companies to traditional enterprise businesses. To analyze a graph, a user often executes isolated graph queries using a dedicated interface—a procedural graph programming interface or a declarative graph query language. T...

متن کامل

Interactive Exploration on Large Genomic Datasets.

The prevalence of large genomics datasets has made the the need to explore this data more important. Large sequencing projects like the 1000 Genomes Project [1], which reconstructed the genomes of 2,504 individuals sampled from 26 populations, have produced over 200TB of publically available data. Meanwhile, existing genomic visualization tools have been unable to scale with the growing amount ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The American Statistician

سال: 2022

ISSN: ['0003-1305', '1537-2731']

DOI: https://doi.org/10.1080/00031305.2022.2087734